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Abstract 

Data were reanalyzed from a previously reported study 
of the passage dependence of reading comprehension 
questions being developed for the revised SAT. The ob- 
jective was to uncover any gender differences in ap- 
proaches to and performance on a task requiring exam- 
inees to answer reading comprehension questions 
without reading the passages on which the questions 
were based. Verbally able males and females were com- 
pared with respect to their reported use of a variety of 
test-taking strategies involving reasoning, personal 
knowledge, and guessing. A few relatively small (and 
often inconsistent) differences were detected between 
male and female test takers. However, far more similar- 
ities than differences were noted with respect to both 
test performance and test-taking behavior. If these re- 
sults are generalizable to more typical tests of verbal 
ability, they would seem to suggest that males and fe- 
males employ quite similar approaches to standardized 
test taking. 


Introduction 

Gender differences in performance on measures of aca- 
demic ability and achievement have generated much 
speculation and considerable research over the years, es- 
pecially for tests on which the stakes are high. To ac- 
count for the disparities, a variety of social, biological, 
educational, and psychological explanations have been 
offered (see Wilder Sc Powell, 1989, for example), and 
many specific factors have been posited as contributing 
to test score differentials. Among the possibilities that 
have been considered are genetic makeup (e.g., Benbow 
& Stanley, 1981), educational background/experience 
(e.g., Pallas & Alexander, 1983), personality traits and 
cognitive styles (e.g., Hassmen Sc Hunt, 1994), method 
of measurement (e.g., Bolger Sc Kellaghan, 1990), char- 
acteristics of test questions (e.g., Becker, 1990), and ap- 
proaches to both problem solving and test taking (e.g., 
Ben-Shakhar & Sinai, 1991; Gallagher, 1992; Johnson, 
1984). Test-taking approaches, for instance, may vary 
with respect to the propensity to guess in the face of un- 
certainty, the extent to which a slow and deliberate style 
is used, and the extent to which such traits as test anx- 
iety and testwiseness influence performance. 

Undoubtedly, more attention has been devoted to 
gender differences in mathematical ability/achievement 
than in other domains, such as verbal reasoning, where 
the differentials are much smaller and sometimes 
nonexistent. For instance, on the basis of a comprehen- 


sive synthesis of studies of performance on various 
kinds of verbal ability measures, Hyde and Linn (1988) 
were willing to assert that “there are no gender differ- 
ences in verbal ability, at least at this time, in American 
culture, in the standard ways that verbal ability has 
been measured,” and Feingold (1988) maintained, in 
both the text and title of his article, that “Cognitive 
gender differences are disappearing.” Apparently, how- 
ever, not everyone agrees. Halpern (1989), for instance, 
responded to Feingold’s claim by noting that 
whether “we find gender differences depends on what, 
who, and when (in the life span) we test” (p. 1 157). 

Significance of Research on 
Gender Differences 

Assuming that gender differences on measures of acad- 
emic ability and achievement are indeed small, we might 
appropriately ask “Why study the possible causes of 
such differences at all?” The answer, according to 
Wilder and Powell (1989), is that “There are real, quan- 
tifiable educational and social consequences of test per- 
formance. Even small differences can add up to major 
effects in the aggregate. Slight shifts in the ratio of male 
to female superiority in a domain can alter the nature 
of the population that qualifies for special awards, 
scholarships, programs, and educational opportunities” 
(p. 31). 

The relevance of this argument is apparent when 
examining trends in the SAT verbal scores of male and 
female test takers. The mean verbal score of females had 
been higher than that of males until the mid-1980s, 
when the average score of males first surpassed that of 
females (Burton, Lewis, & Robertson, 1988). Halpern 
(1989) noted the impact of this reversal — a drop in the 
number of female students receiving prestigious schol- 
arships on the basis of high SAT scores. As Stanley, 
Benbow, Brody, Dauber, and Lupkowski (1992) have 
observed, even seemingly negligible mean differences 
can be associated with “sizable [differences in] odds” at 
the upper end of the score distribution (p. 47). 

Previous Research and Further 
Hypotheses 

Recently, Powers and Wilson (1993, in press) adminis- 
tered a highly unconventional verbal reasoning rest — 
answering SAT reading comprehension questions 
without the availability of the passages on which the 
questions were based — to a sample of verbally able high 
school juniors. The study was designed to evaluate the 
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reading comprehension questions being developed for 
the revised SAT, specifically, to evaluate the degree to 
which performance depends on having read the pas- 
sages with which the questions are associated. A second 
objective was to determine the construct relevance of 
any strategies that students might use when they resort 
to answering questions without actually reading the 
passages. The study was partly in response to a recent 
revival of criticism that, because test takers can, to some 
degree, answer questions correctly without consulting 
the reading passages, the verbal portions of such tests as 
the SAT are not valid measures of reading comprehen- 
sion (Katz, Lautenschlager, Blackburn, &c Harris, 
1990). 

Although unique, the task had some similarities to 
the kind of ill-structured, unfamiliar reasoning prob- 
lems that are encountered in real life (N. W. Burton, 
personal communication, January 4, 1993). The task 
also had another interesting feature. Withholding the 
reading passages was, in a sense, equivalent to stripping 
away all relevant context, i.e., creating an exercise that 
was virtually totally decontextualized (and perhaps as 
Mwauthentic as we could possibly imagine). As Gold- 
stein (1994) has pointed out, there is considerable evi- 
dence that the context in which a test question is em- 
bedded can appreciably alter its difficulty. Linn (1992) 
suggested that differential performance by males and fe- 
males on various college admission tests was also re- 
lated to the context in which reasoning was tested. 
Others have attempted to explain the relevance of con- 
text to gender differences by suggesting that females are 
more attuned to contextual clues than males are. For 
example, Goldberger (1992) asserted that “Far more 
than men, women and girls emphasize the importance 
of care, connection, and context as central to thought 
and problem solving. Men, more often than women, 
emphasize abstract principles and universal solutions in 
decision making” (p. 140). 

A reasonable prediction stemming from this state- 
ment is that males should outperform females on a 
reading comprehension task for which the text has been 
removed. Based on a very small (N=2) informal survey, 
however, it was clear that alternative forecasts can be 
advanced with just as much conviction. For example, 
Graduate Record Examinations Executive Program Di- 
rector Charlotte Kuh predicted that because of their su- 
perior inferencing skills, females would perform better 
than males. Because they read more literally than fe- 
males, males would be at a greater disadvantage when 
not having the passages to reference (personal commu- 
nication, October 18, 1994). Educational Testing Ser- 
vice (ETS) Research Scientist Ann Gallagher, on the 
other hand, speculated that males would perform better 


because of their greater confidence and their willingness 
to “wing it” in the face of uncertainty (personal com- 
munication, November 3, 1994). In any event, as Mes- 
sick (1994) has noted, the usefulness of context depends 
on which features examinees respond to, and, in partic- 
ular, the relevance in the present study of characteristics 
such as degree of concreteness or ambiguity to the con- 
struct being measured. 

Objectives 

The objective of this follow-up study was to compare 
males and females with respect to their test perfor- 
mances when the reading passages were not accessible, 
and also to determine any differential tendencies to use 
particular test-taking strategies or to consider particular 
test characteristics in this unusual test-like task. 

Among the questions of interest were the following: 

1. Are males and females equally likely to use a 
“process-of-elimination” strategy? Do they rule 
out the same number of alternatives per item? 

2. Do males and females attend to the same features 
of answer choices? Do they use these features in 
the same way (i.e., to select versus to rule out 
choices)? 

3. Do males and females differ in the kinds of 
guessing (random, informed, patterned, or intu- 
itive) in which they engage? 

4. Are males and females equally inclined to invoke 
personal knowledge or experience? Does this in- 
clination differ according to the content of 
reading passages? 

5. Do males and females resort to various reasoning 
strategies with the same degree of frequency? 

These types of comparisons have the potential to 
shed additional light on how test-taking styles may con- 
tribute to gender differences in performance on tradi- 
tional multiple-choice tests. 


Method 

Data collected in a previously reported study were re- 
analyzed. The specific procedures used in the study are 
described in some detail elsewhere (Powers & Wilson, 
1993, in press). It will, however, be useful here to sum- 
marize some of the study’s salient features. 

Six reading passages and the questions associated 
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with them were selected from among a larger pool that 
had been pretested for the revised SAT. The nature of 
these passages and the number of questions associated 
with each one were as follows: 

1. A passage of approximately 900 words on lan- 
guage, in which the author, a Japanese Amer- 
ican, recounts an experience he had just after the 
United States entered the Second World War (12 
questions). 

2. A 500-word passage adapted from an excerpt of 
a memoir written by Elizabeth Bishop about the 
poet Marianne Moore (6 questions). 

3. An 800-word passage about Clarence Darrow 
and the Communist trial of 1920 (9 questions). 

4. A 600-word passage that presents a theory about 
the nature of the object that exploded above 
Tunguska in 1908 (9 questions). 

5. Two passages totalling about 800 words that 
present two views of the architectural design of 
cities. One discusses planned, medium-sized 
cities; the other offers a critique of modern cities 
(13 questions). 

6. A 500-word passage excerpted from a book of 
literary criticism analyzing the work of Richard 
Wright (1908-1960) (5 questions). 

Three different test forms, each consisting of ques- 
tions concerning two different passages, were assembled 
and administered to volunteering students at eight sec- 
ondary schools in seven states. Classroom teachers dis- 
tributed the tests in a spiral fashion so that approxi- 
mately one-third of all students (N=350) took each 
form. After attempting the task, examinees were asked 
to reveal how they had approached it — for example, the 


TABLE 1 


SAT Scores of Study Sample by Gender 


Male 

M S.D. 

Female 

M S.D. 

d 

SAT-V 

552 

86 

5.54 

89 

.21 

Reading 

55.0 

8.8 

5.5.4 

9.4 

.18 

Vocabulary 

54.1 

9.1 

52.7 

9.2 

.15 

SAT-M 

640 

9,5 

578 

100 

,65* 

TSWE 

51.7 

5.8 

52.5 

7.2 

-.08 


Note: N = 122 males and 146 females for whom SAT records were available. 
d = effect size, i.e., difference between means divided by the pooled wirhin- 
groups estimate of the standard deviation. 

‘p<.001 


extent to which they employed different kinds of rea- 
soning strategies, the degree to which they invoked per- 
sonal knowledge or experience, the extent (and kind) of 
guessing they used, and the particular features of an- 
swer choices (e.g., specificity, length, abstractness, etc.) 
that were considered to either select or rule out options. 
Information on the extent to which examinees elimi- 
nated answer alternatives was also collected during the 
test. 

The following comparisons are based on 335 study 
participants whose sex was either known from SAT files 
(N=268) or inferred from first name (N=67). The sex of 
15 students could not be determined because no SAT 
records could be located and because first name (e.g., 
Robin, Lee, BJ) did not allow a trustworthy inference. 
These 15 subjects were therefore not included in the 
analyses. 

Results 

Description of Sample 

SAT records were available for 80 percent of the 
sample. Table 1 compares performances on the SAT 
and on the Test of Standard Written English (TSWE) 
for males and females in the study sample. Table 2 


TABLE 2 


Characteristics of Study Sample 

by Gender 



Male 

Female 


OS = 122) 

fN = 146) 

Honors English (%| 

55 

52 

High School Rank {%) 

Top tenth 

54 

27 

Second tenth 

20 

15 

Second fifth 

10 

12 

Third fifth 

5 

7 

Fourth fifth 

0 

0 

Fifth fifth 

0 

0 

Unavailable 

52 

59 

High School GPA (%1 

27 

28 

A+ 

18 

14 

A 

20 

14 

A- 

11 

18 

B 

24 

26 

C 

<1 

0 

D, E, or F 

0 

0 

Unavailable 

27 

28 
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shows comparisons with respect to high school achieve- 
ment (grade-point average and rank in class} and course 
participation (honors English course). As the tables 
show, students in the sample were extremely capable, 
on average, in terms of each of these indicators. Except 
for the significantly better average performance of males 
on the SAT mathematical section (SAT-M), there were 
no large differences between males and females in the 
study on any of several indicators of academic ability. 

Test Performance Without 
Reading Passages 

As Table 3 shows, females performed significantly 
better (p<.05) than males on the passage about Richard 
Wright. This particular question set differed from the 
other five included in the study mainly in the extent to 
which personal knowledge was invoked when an- 
swering the questions, as is shown later. The effect size 
(of nearly .5) is considered “medium” according to one 
widely used standard (Cohen, 1977). There were no sig- 
nificant gender differences on any of the other five ques- 
tion sets, nor was there any tendency for males or fe- 
males to perform better overall. One of our reviewers 
suggested that the failure to detect gender differences in 
performance on the task may have been due at least in 
part to the overall difficulty of the task, resulting in a 
relatively restricted range of performance by both males 
and females. 

At the level of individual questions, significant 
ip<.05) between-group differences in question difficulty 


TABLE 3 


Test Performance (Without Passages) by Gender 

Test Form/Passage 


Males 

Females 

d 

A Language 

M 

3.13 

3.18 

-.03 

(N = 12 Items) 

S.D. 

1.93 

1.6.5 


A Marianne Moore 

M 

1.66 

1.89 

-.18 

(N = 6 Items) 

S.D. 

1.30 

1.25 


B Clarence Darrow 

M 

3.37 

3.54 

-.10 

(N = 9 Items! 

S.D. 

1.78 

1.67 


B Tunguska 

M 

3. .38 

3.17 

.25 

(iV = 9 Items) 

S.D. 

1.73 

1.57 


C Architecture 

M 

3.96 

3.52 

.29 

= 13 items) 

S.D. 

1.78 

1,39 


C Richard Wright 

M 


; 3.19 

-.48* 

(iV = 5 Items) 

S.D. 

1.27 

1.05 


Note: N = 53 males, 71 females for Form A; 52 males, 52 females for Form B; 

and 49 males, 58 females for Form C. 




d = effect size, i.e., difference berween means divided by the pooled within- 

groups estimate of the standard deviation. 




*p<.05 






were detected for 4 of the 54 questions on all forms — 
about what would be expected by chance. All were in 
favor of males, and 3 were based on the Tunguska pas- 
sage. The correlation of question difficulty for males 
with that for females (with respect to a simple rank or- 
dering of difficulty) was .79, .74, and .93 for test forms 
A, B, and C, respectively. 

Use of Various Test-Takmg 
Strategies 

Tables 4, 5, and 6 show the frequencies, by test form, 
with which male and female test takers used each of a 
variety of strategies related to reasoning (or inference), 
personal knowledge, and guessing. Overall, there were 
few statistically significant differences between the gen- 
ders, and no consistent differences across test forms for 
any of the strategies. For test form C, females were less 
likely than males to report that, for a large proportion 
of questions, they had either “Chose[n] an answer be- 
cause it seemed to be consistent with something stated 
in the other questions” or “Guessed a particular 
choice.” Females were also more likely than males to 
state that they recognized the Marianne Moore passage 
or knew its source. Each of these differences can best be 
described as approaching “medium” in effect size. 

Use of Features of Answer 
Choices 

Table 7 shows that, with respect to the use of 26 pos- 
sible answer-choice features to select answers, male and 
female test takers differed significantly on only two. 
They differed with regard to the use of three features for 
eliminating answer choices (Table 8). The only consis- 
tency was that males had a greater tendency, compared 
to females, to select choices they regarded as “outdated 
or old-fashioned,” and to eliminate choices that were 
seen as “in tune with current thinking.” Table 9, which 
displays the mean number of answer choices eliminated 
by males and females, does not suggest any differential 
tendency for males or females to rule out alternatives as 
implausible. 

Consistency in Use of Strategies 
by Gender 

It is evident from the various tables discussed previously 
that there were few detectable differences between male 
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TABLE 4 


Use of Reasoning Strategics by Gender 


Strategy 

Test 

Form 

Males 

Females 

d 

Chose an answer because it 
seemed to be consistent with 
something stared in the 
other questions 

A 

2.75 

2,68 

.08 


B 

2.79 

2.79 

.00 


C 

1.67 

2.24 

.42* 

Tried to determine the meaning 
of a word, or phrase, or the way 
in which it was used, from the 
ocher questions in the set 

A 

2.70 

2.55 

.14 


B 

2.58 

2.56 

.02 


C 

239 

2.69 

-.31 

Ruled out an answer because it 
seemed to contradict something 
in the other questions 

A 

2.42 

2.28 

,12 


B 

2.66 

2.50 

.14 


C 

2.35 

2.47 

-.12 

Assumed, guessed, or knew the 
answers to some questions and 
then, on the basis of these answers, 
reasoned what rhe answer to a 
later question would have to be 
(or what it could not be) 

A 

2.42 

2.31 

.11 


B 

2.46 

2.33 

.12 


C 

2.24 

2.24 

.00 

Chose an answer because it 
resembled something in the 
question: I associated a word, 
phrase, or idea in the question 
with something in the answer 1 
chose 

A 

2.10 

2.03 

,06 


B 

2.18 

2.32 

-.13 


C 

2.09 

1.88 

-.17 

Tried to reconstruct the theme 
or main idea of the missing 
passage by reading all the 
questions and answers 

A 

62% 

50% 

.23 


B 

57% 

52% 

.10 


C 

64% 

57% 

.14 


Note: Entries for rhe first five strategies are means on a 0 to 4 scale, with 
points indicating use of the strategy for few or no questions (0), about 25% of 
the questions (1), about half of the questions (2), about 75% of the questions 
(3), or all or nearly all questions (4). Entries for the sixth strategy are 
percentages of "yes” responses. 

d = effect size, i.e., mean tor males minus mean for females divided by the 
pooled within-groups standard deviation. 

><.05 


and female test takers with regard to the use of strate- 
gies to select answer choices. In fact, the degree of sim- 
ilarity in patterns of strategy use was far more apparent 
than were any dissimilarities. Table 10 reveals that the 
similarity between males and females with regard to the 
frequency of use of various strategies (correlations of 
the rank order of frequency of use) was generally very 
strong. For example (although not shown in Table 10), 
for test forms B and C, both males and females “Used 
vague hunches or intuition” more frequently than any 
other guessing strategy. “Guessed among two or more 
choices that couldn’t be eliminated” was the second 
most frequent, “Guessed randomly among all choices” 
was the third most frequent, and “Guessed a particular 


TABLE .5 


Use of Personal Knowledge by Gender 
Kind of Personal 


Knowledge Used 

Test Form/Passage 

Males 

Females 

d 

Knowledge abqut rhe 





topic learned in or 





outside of school 

A Language 

1,53 

1.52 

.01 


A Marianne Moore 

0.77 

0.99 

-.21 


B Clarence Darrow 

1,60 

1.42 

.14 


B Tunguska 

1.33 

1.12 

.16 


C Architecture 

1.02 

1.07 

-.04 


C Richard Wright 

1.84 

2.12 

-.20 

Recognized the 





passage or knew 





its source 

A Language 

4% 

4% 

-.02 


A Marianne Moore 

;o% 

«% 

-.40' 


B Clarence Darrow 

10% 

4% 

.23 


B Tunguska 

16% 

8% 

.25 


C Architecture 

2% 

0% 

.01 


C Richard Wright 

22% 

36% 

-.30 


Recognized the 
author and was 
familiar with his/her 


opinions 

A Language 

2% 

1% 

.04 


A Marianne Moore 

4% 

4% 

-.03 


B Clarence Darrow 

8% 

2% 

.28 


B Tunguska 

4% 

2% 

.10 


C Architecture 

2% 

0% 

.21 


C Richard Wright 

46% 

31% 

.31 


Note: Entries for "Knowledge about . . are means on a 0 to 4 scale, with 
points indicating use of the strategy for few or no questions (0), about 25% of 
rhe questions (1), about half of the questions (2), about 75% of the questions 
(3), or all or nearly all questions (4). 

For "Recognized the passage . . and "Recognized the author . . . the 
entries are percentages of “yes” responses. 

d = effect size, i.e., mean for males minus mean for females divided by the 
pooled within-groups standard deviation. 

><.05 


choice” was the least frequent guessing strategy used, 
resulting in a perfect rank order correlation of 1.00. The 
greater inconsistency between males and females in the 
extent to which they used personal knowledge was due 
mainly to the fact that females more frequently recog- 
nized the Marianne Moore passage. 

Discussion 

A variety of test-taking behaviors was examined in 
order to determine any gender differences in approaches 
to and performance on an unusual verbal reasoning 
task. With regard to these behaviors, we found no con- 
sistent differences between males and females in elimi- 
nating answer choices because they were implausible. 
Nor were there any consistent differences between the 
genders in their reported use of numerous characteris- 
tics of response alternatives either to rule out choices or 
to select them. This finding is significant, because a 
“process-of-elimination” strategy is one that is often 
strongly advocated by commercial coaching schools as 
a way to “beat” standardized, multiple-choice tests. 
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TABLE 6 


Use of Guessing Strategies by Gender 

Strategy 

Test Form 

Males 

Females 

d 

Guessed among two 
or more choices that 





couldn't he eliminated 

A 

l.U 

2.38 

-.14 


B 

2.29 

2.29 

-.01 


C 

2.41 

2.62 

-.20 

Used vague hunches 





or intuition 

A 

2.55 

2.38 

.14 


B 

2.08 

2,13 

-.05 


t; 

2.04 

2.14 

-.09 

Guessed randomly 
among ail choices 

A 

,73 

1,07 

-.35 


B 

.67 

.90 

- 27 


C 

.88 

1.05 

-.16 

Guessed a particular 





choice (e.g., A or C) 

A 

.47 

.41 

.07 

B 

.54 

.39 

.15 


C 


'.4} 

.40* 


Note: Entries are means on a 0 to 4 scale, with points indicating use of the 
strategy for few or no questions (0), about 25% of the questions (1), about 
half of the questions (2), about 75% of the questions (3), or ail or nearly all 
questions (4). 

d = effect size, i.e., mean for males minus mean for females divided by the 
pooled within-groups standard deviation. 

V<.05 


(This strategy is also typically mentioned in test famil- 
iarization materials provided by test sponsors as an ap- 
propriate basis for making informed guesses.) That no 
notable differences were found is also meaningful in 
light of speculation (e.g., Linn, 1992) that males and fe- 
males differ with respect to their willingness to take 
risks. (Insofar as eliminating the right choice is a distinct 
likelihood, this strategy could be a shaky one.) 

Some researchers have suggested that, when forced 
to guess, females may favor certain kinds of options 
over others, e.g., familiar, nontechnical answer choices 
over unfamiliar, technical ones (Strang, 1977). We 
found little evidence, however, of any differences in 
preference according to gender, aside from one curious 
(and inexplicable) finding — that relative to females, 
males may be more inclined to select answer choices 
they consider to be “outdated or old-fashioned” and 
to eliminate choices they regard as “in tune with cur- 
rent thinking.” If more than a chance finding, this re- 
sult could be due to different beliefs about how to 
outwit the test maker. Before much credence is placed 
on this isolated outcome, however, follow-up is re- 
quired. 

The extent to which personal knowledge was in- 
voked to answer questions seemed to be more a func- 
tion of the content of the passages/questions than of 
gender. Although there was some hint that, with respect 
to this type of strategy, males and females may differ ac- 
cording to the content of the reading passages, there 
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Frequency of Use of Various Answer-Choice Features to 
Select Answers by Gender 

Feature 

Males 
(N = 1S4) 

Females 
fN = 181) 

More carefully worded/qualified 

76% 

77% 

More concrete 

75 

65 

More defiime/absolute 

66 

63 

More specific 

64 

70 

At the center of all the other choices, 
something in common with all choices 

62 

62 

In tune with current thinking 

60 

66 

More positive/iess critical in lone/mood 

58 

62 

More complex 

5S 

52 

Most obvious 

48 

45 

More common/normal 

48 

45 

Least ambiguous 

44 

39 

More uncommon/unusual 

40 

36 

Longer 

39 

32 

More general 

38 

36 

Simpler 

37 

41 

Least obvious 

34 

45* 

Less definite/more relative 

32 

33 

Shorter 

32 

29 

More abstract 

30 

31 

Most ambiguous 

29 

29 

More negative/more critical in tone/mood 

29 

28 

Outdated, old-fashioned 

27 

IS** 

Too neutral 

26 

20 

Too similar to other choices 

25 

20 

Too extreme 

18 

18 

Less qualified 

14 

13 

•p<.05 

•*p<.01 


was little consistency in the findings. Nor did we detect 
any noteworthy differences between males’ and females’ 
use of several strategies that we regarded as illustrating 
reasoning. 

To summarize, in a highly able sample of secondary 
school juniors we found remarkably few differences be- 
tween males and females with respect to either perfor- 
mance on, or strategies used for, an unconventional 
reading task. The occasional differences that were noted 
were not consistent across the three test forms, sug- 
gesting that the few differences that were identified may 
have been more a function of the particular content of 
the questions (and the missing reading passages) than of 
the task itself. This inference is consistent with the fact 
that the passages (and questions) are based on a variety 
of subject matter that may have been differentially fa- 
miliar or interesting to male and female test takers. 
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TABLE 8 


Frequency of Use of Various Answer-Choice Features to 
Eliminate Choices by Gender 

Feature 

Males 
(N = 1S4) 

Females 
fN = 181) 

Less qualified 

75% 

69% 

Too extreme 

70 

66 

More abstract 



Too similar to other choices 



Less definite/morc relative 

64 


More general 

62 

59 

Too neutral 

60 

60 

More ncgative/more critical in tone/mood 

56 

56 

Outdated, old-fashioned 

55 

55 

Most ambiguous 

53 

45 

Simpler 

52 

49 

Least obvious 

52 

45 

Most obvious 

47 

49 

More uncommon/unusual 

47 

46 

More complex 

36 

43 

More common/normal 

36 

33 

More specific 

34 

33 

Least ambiguous 

34 

32 

Shorter 

33 

29 

More positive/less critical in tone/mood 

31 

22 

At the center of ail the other choices, 
something in common with all choices 

30 

1** 

More definite/absolute 

27 

29 

Longer 

27 

27 

More concrete 

22 

22 

In tune with current thinking 

Z2 

11** 

More carefully worded, qualified 

21 

14 

><.05 


Even more noteworthy than these infrequent differ- 
ences, however, were the considerable similarities found 
between the genders in their approach to and perfor- 
mance on the exercise. We found little consistent evi- 
dence to suggest that performance on the task differs by 
gender. Apparent differences in the ways in which males 
and females confronted the assignment were detected 
only about as frequently as might be expected by chance 
alone. In fact, the frequencies with which males and fe- 
males employed various strategies were extraordinarily 
similar. If any conclusion is to be drawn from these data 
then, it is that males and females are far more alike than 
they are different with respect to both performance on 
and strategies used for the task studied here. 

A possible limitation of the study, which may mod- 
erate the findings, is the low-stakes nature of the task 
that was presented. That is, test performance did not 


TABLE 9 


Use of an Answer Elimination 

Strategy by Gender 



Mean Number of Answer 
Eliminated 

Choices 

Test Form/Passage 

Male 

Female 

d 

A Language 

12.7 

14.3 

-.16 

A Marianne Moore 

5.0 

5.7 

-.12 

B Clarence Darrow 

10.4 

13.5 

-.38 

B Tunguska 

9.6 

11.0 

-.15 

C Architecture 

18.1 

15.1 

.27 

C Richard Wright 

7.2 

6.0 

.24 

Note: d = effect size, i.e., mean for males minus mean for females divided by 
the pooled within-groups standard deviation. 


“count” in any meaningful way for the subjects partici- 
pating in the study. Under operational conditions, in 
which test takers can be expected to be more motivated, 
more anxious, etc., the test-taking strategies studied 
here might be used differently by males and females. 
Still other strategies might also come into play. This 
speculation is perhaps moot, however, as this sort of 
task is unlikely to be encountered outside the context of 
a research study. 

There were several other limitations as well. One 
reviewer was skeptical about the accuracy of examinee 
reports on strategy use, and would have preferred an al- 
ternative means of gathering these data. The rationale 
for our choice (and a discussion of the trade-offs in- 
volved) has been presented elsewhere (Powers and 
Wilson, in press). Despite these limitations, the study re- 
sults are informative, we think, as further documenta- 
tion of how males and females approach standardized, 
multiple-choice tests. 


TABLE 10 


Correlations Between Males and Females of Rank Ordering 
of Frequency of Use of Various Strategies 




Test Form 

Category of Strategy 


A 

B C 

Reasoning 


.97 

.90 .55 

Guessing 


.95 

1.00 1.00 

Persona! knowledge: 




About the topic 

.94' 



Recognized the passage or it.s source 

.53- 



Recognized the author or his/her opinions 

.85' 



Use of answer-choice features to: 




Select answers 

.96' 



Eliminate choices 

.97" 



'Computed over all six passages. 




'Computed over all subjects, regardless of test 

form. 
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